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§1: Introduction. 

Arnold's diffusion has been established for the simple example proposed by Arnold, 
Q^l [A], following a nonvariational method, [CG], and variational methods, [Be], [Br]. The 

^ • nonvariational method yields estimates that are terribly big; the variational method 

I instead gives better estimates, "/asf ' , ( [Be] ) and even very good, "polynomial", ones 

a^ : ([Br])- 

Here I illustrate the method of [CG] by developing it with the aim of showing the 
existence of diffusion, without actually constructing time scales bounds on the diffusing 
trajectories. This may lead to a clarification of a method which maintains its interest in 
spite of the better estimates coming from variational methods because it is the only one 
, which, so far, is robust enough to apply to anisochronous systems. 

0^ ' If explicit estimates are avoided one gains enormously in simplicity: this kind of approach 

, was probably the one meant in [A] where the problem was first posed and solved without 

bothering to give the (fairly obvious, see §5) details. What follows applies also to the 
Arnold's case, but I prefer to illustrate it in a case that is even simpler. 
^ ' Furthermore I show that if a new idea is added to the method of [CG] then one can get 

*^ a "fast" (still exponential) estimate for the drift time. 

Q ' Here we consider hamiltonians H with three degrees of freedom described by coordinates 

! I e R, A= (Ai, A2) e i?2 and angles (peT^,a= (ai, a2) G T^: 



- 

X 



/2 

^- A+ — +g\cos^-l)+ef{^,a) (1.1) 



where lo — [uji,uj2) G i? is a vector with diophantine constants C, t, i.e. such that 
^ i for all integer components vectors v — {vi,i'2) it is [tj • ^ C\v_y 'il v_ / 0; 

the perturbation f is supposed to be a (fixed) trigonometric polynomial of degree N: 
fi^, fi) = J2o<\^\<N,\n\<N f";E cos(n(/3 + E ■ a). The subject being fairly weU under- 
stood we do not need to be really very careful about units so that some coefficients in 
(1.1) have been set equal to 1. 

One can also use the well known Jacobi's hyperbolic coordinates po^qo to describe the 
pendulum + g'^{cos^p — 1) near the unstable point I = ip = 0, see appendix Al. In 
the new coordinates, which we denote poi 90: the pendulum hamiltonian becomes J(po'Zo) 
with J'{x) = 5 + X^J^i 5"^" 9{^) s-iid the total hamiltonian becomes: 

= LH ■ A^ + J{paqo) + £/o(a ,Po,go) (1-2) 
where Aq= A, = and /o(a ,Po, go) = /(^P, a). 

The function /o, still a trigonometric polynomial in a, has the property: fo{a,p,q) = 
fo{— a,q,p) = fo{—a, —p, —q) and we shall call parity the 4-elements group of transfor- 
mations generated by Pi : {a,p,q) < — > {—a,q,p) and P2 : ia,p,q) < — > {~ a, —p, —q). 
We call Pq I -Pi 7 P the group elements; we say that /o has "even parity" . li F{a,p,q) — 
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-~F{Pj{a,q,p)) — PjF{a,q,p), j = 1,2, we say that F has odd parity: for instance 
da fo has "odd parity" . The p derivative F or the q derivative G of an even function have 
the property F = PiG = —P2F, G = PiF = —P2G. The Jacobi's map in general trans- 
forms functions oi (p, a with given parity in a,y> in the ordinary sense into functions 
with the same parity in the {a,p,q) variables. 

§2. Invariant tori and nearby flow. 

We look for a change of coordinates {Aq, aQ,pa,qo) < — » {A, tp,p,q) which integrates 
locally (1.2) near the unstable equilibrium of the pendulum. More precisely so that in 
the new coordinates the motion is: 

A = const, + uit, p^ q ge(i+'^)»* (2.1) 

where g = g{pq), 7 = "iipq) and x —>■ 7(2;) is a suitable fimction analytic in x near x = 
while g{x) = J'{x) (sec (1.2). We shall attempt to write the change of coordinates: 

Ao=A + H{ip,p,q), po=p + L{'tp,p,q) 

~ - ~def (2-2) 

a=±, qo = q + L{'ip,p,q) = q + L{-'tj^,q,p) 

where if has zero average and even parity. Setting qdq —pdp and imposing that 
(2.2) and (2.1) verify the equations of motion one gets the equations: 

{g{x)0+ w -d^) H = - edafo{±,Po,qo) - 9{x)^{x)0 H_ 
{9{x)+g{x)0 + LO ■ d^) L = - edg„fo{'>p,po,qo) - {g{xo) - g{x))po+ (2.3) 

+ 'y{x)g{x)p - g{x)'y{x) 0L 

where x = pq, xq = po9o, Po = P + qo = q + L. In fact the second equation is 
independent on the first. Both can be shown to admit, for e small enough as we shall 
always suppose below, a solution analytic in s and divisible by s. Note that the unknown 
are I£,L,j. A proof is essentially in the basic paper [Ge]: it follows the Eliasson's 
method, [E], as developed in [G2], [GG]. A "classical" {i.e. by quadratic iterations) proof 
can be derived from §5 in [CG] where the harder anisochronous case is detailed. 

§3. Stable and unstable manifolds. 

From (2.2) we can read the following facts: 

(1) Phase space contains a family of invariant tori T{A) parameterized by A G i?^ and 
obtained by setting p = q = 0. The average position of the tori is precisely A , because 
H has zero average: average with respect to ij) or to time. This is a general property 
of Thirring's models (Uke (1.1), see [T] and [G2y) "twistless tori". 

(2) Given A and setting q — 0, p =/= one obtains a surface whose points are param- 
eterized by tp ,p and which is a local piece of the stable manifold ^{A) of T{A). 
The quantity —(1 + ^{x))g{x) is the "Lyapunov exponent" of W^{A). Likewise, set- 
ting p = 0, g ^ one defines a local piece of the unstable manifold W'^{A) of A and 
(1 + "f{x))g{x) is the corresponding exponent. 

(3) Given A and setting q ^ 0,p ^ one parameterizes the rest of phase space near 
T{A). In this part of phase space the motion is in some sense very regular. 

(4) However the motions, just described locally, are globally more interesting and chaotic. 
In fact generically W""{A), W^{A') do intersect transversally if A, A' are close enough 
(depending on e) and T{A),T{A') have the same energy. In such cases W"^{A) n 
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W^{A') consists of trajectories, or heteroclinic intersections, running asymptotically 
around T{A) as t ^ —oo and around T{A') as t ^ +00. 

The symmetry of the problem implies, see [CG], that if A — A' and (p = w then 
the point of W''^{A) with (fi = tt. a = is homoclinic, i.e. it is on the trajectory 
W^{A)C^W''{A). The intersection W{A)f^W'{A') exists for a\\ A, A' close enough 
and is generically transversal in the sense that if we fix 1^ = tt (or ip to any other value 
7^ 0, 27r) then any pair of tangents to W^{A) and W^{A') at common points form an 
angle > /x > 0; the bound jj, depends on e, of course, and it is generically proportional 
to e. 

(5) Finally a definition: Let Aq, A^, . . . , Aj^ be a sequence such that \Aj — Aj_^_i\ is 
so small that W^{Aj) n W^{Aj_^_-^) have a transversal heteroclinic intersection, in the 
above sense, with intersection angles > /i at </? = tt . We call such a chain a heteroclinic 
chain or ladder. One finds in various simple examples jj, = 0{e) = 0{N'~^), see [CG]: for 
a general theory of the splitting see [GGM] and appended references. 

We shall prove the following theorem ("Arnold's diffusion" or "drift"): 

Theorem 1: Let Aq, Ai, . . . , Aj^ be a heteroclinic chain: for any 6 > there are 
trajectories starting within 6 of T{Aq) and arriving after a finite time T within 6 of 

This theorem is proved in [CG]; I prove it here againg along the lines of [CG]: the purpose 
being of showing the conceptual difi^erence with respect to the variational approaches, 
which accounts for the impressive diflference in the time scale of T compared with [Br] 
or with the estimate in theorem 2 below (see (6.5)). 

§4. Geometric concepts. 

Let 2 K > be smaller than the radius of the disk in the (p, q) plane where the functions 
in (2.2) are defined. We call k a "target parameter". 

To visualize the geometry of the problem involving 2-dimensional tori and their 3- 
dimensional stable and unstable manifolds, in the 5-dimensional energy surface, we shall 
need the following geometric objects: 

• (a) a point Xi, heteroclinic between T{A^) and T{A^_^_i), which has local coordinates, 
see {2.2),Xi = {Ai,^.,0,K). 

• (b) the equations, at fixed g = k, of the connected part of W^{A^_^-^) containing X,, in 
the local coordinates near T{A^); they will be written as: 

^i(i) = (A:+i(i),^,rf+i(i):«) (4-1) 

with \ ip — tp .\ < ^ foY some C > (i-independent): it is A^_^-^^{'ip .) = A^, p|_,_j(^ ■) = 
because we require Yi{ip .) = X^. There are constants F',F such that |Ai+i(V') ~ 
i4i+i( j)| and max|^_^ \= fixed bf+i(V')| are boimded, for C small enough, below by 
F'\\p — J and above hy F\ip — tp the constants F', F have size 0(/x). 
Note that {A^j^-^) also contains a part with local equations {A^^i, tp,p,0) which 
is not to be confused with the previous one described by the function Yi ( ijj). This is 
more easily understood by looking at the meaning of the above objects in the original 
{A, a,I,ip) coordinates: in a way the first part of W^{A^_^_i) is close to (p = and the 
second to p ^ 2-k. They can be close because of the periodicity, but they are conceptually 
quite different. 

•(c) a point Pj = Yi ( V' ) with | ^ . — ^ . | = r, , where .,ri will be determined recursively, 
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and a neighborhood Bi: 

Bi = {\A- <P^, \t- ti\ <Pi^ \pt+i{±)-p\ <P^, q = K} (4.2) 

where pi < r, is another length to be determined recursively. If ^, 2g are a lower and upper 
bound to {1 + ^{x))g{x) for \x\ < 4k^, the point Pi evolves in a time % ~ g^^ logK"-^ 
into a point X- near T(A^_,_j) which has local coordinates X- = ip'.,K,0). 

•(d) The points ^ of the set Bi are mapped by the time evolution to points that, at the 

beginning at least, come close to T( A^^i) and in a time t(^) acquire local coordinates 
near T{A^_^_i) with p = k exactly: the time t(^) is of the order oig^^ logK"""^. 
If St is the time evolution flow for the system (1.1) we write = Sr{^)£, (note that S 
depends also on i). Then S maps the set Bi into a set SBi containing: 

B'i = {\A- A,+,\<^p„ \t-t'^\<^Pi, P = K, \q\<^Pi} (4.3) 

because all the points in Bi with A = :di+i(V')) P = p|+i(V')) q = n evolve to points 
with A = A^_^_l, p = k, q = and close to ip'., by the definitions. Here S is a bound 
on the jacobian matrix of S (which, being essentially a flow over a time 0{'g~^ logK~^), 
has derivatives bounded i-independently: since we suppose that e is "small enough" we 
could take S = 1 + 6e for some 6 > if, as often the case, \A^ — A^_^_i\ < 0{e)). 

§5. The [CG]-method of proof of the theorem. 

Consider the points Yf+i ( V' ) € ( ii i+2 ) with coordinates ( A 'l_^_2 ( V' ) > ip , Pi+2 ( V' ) j '^) • 
They will evolve backwards in time so that A stays constant, tp evolves quasipcriodically 
hence "rigidly", and p|_|_2 (''/') evolves to k while the g-coordinate evolves from k to 
q = pf+2(V') (because pq stays constant, see (2.1)). The time for this evolution is 

~5-ilogAv|p|+2(i)r^-XT^:S ^ + 

— i iii+i 

Therefore there is a sequence V" such that V" ~^ ^i+i' Pl+2(V'") ^ 

4j+2(V'") liii+i O'lT'd — u}_T.^n tp'., as a consequence of the diophan- 

def 

tine properties of w. So that there is = V'" with n large enough and a point 

Pi+i = iAt+2{ti+i)' t^+vPi+2it,+l)'|^) e W'iA^^^) (actually inflnitely many) 
which evolves, backwards in time, from Pj+i to a point of B'^. 

Hence we can define r^+i — | — V'^_,_]^l and pi+i small enough so that the backward 
motion of the points in Bj+i enters in due time into B'^. It follows that the set Bi evolves 
in time so that all the points of S^+i are on trajectories of points of Bi. Hence all points 
of Bj^f wil be reached by points starting in Bq. 

This completes the proof. All constants can be computed explicitly, even though this 

is somewhat long and cumbersome, sec [CG]. The result is an extremely large diffusion 
time T (namely the value at TV of a composition of TV exponentials! at least this is the 
estimate I get after correcting an error in §8 of [CG]: the error is minor but leads to 
substantially worse boimds). 

Nevertheless the estimate that comes out of the above scheme seems essentially optimal. 
And then the problem is: "how is it possible that by other methods {e.g. variational 

methods of [Be], [Br]) one can get far better estimates? 

A reason may be that the variational methods are less constructive: less so than the 
above. The "fast drifting" trajectory exists but there seems to be no algorithm to de- 
termine it, not even the sequence of its "close encounters" with the invariant tori that 
generates drift: which is in fact preassigned in the above method. This certainly can 
account for a difference in the estimates. In fact the above construction is far too rigid: 
we pretend not only that drift takes place but also that it takes place via a path that 
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visits closely a prescribed sequence of tori in an essentially predetermined way. In §6 a 
less constructive method is proposed and used to obtain bounds: which, however, are 
still far from polynomial. 
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§6. Fast diffusion: elastic heteroclinic chains. 

The following adds a new idea to the method of [CG], exposed in §5, allowing us to 
improve the superexponential estimate of [CG] . Below e will be fixed small enough, and 
'g will be a lower bound to g{x){l + 7(2;)), see (2.1). 

Let y ^ A (y), 2/ e [0, 1], A' {y) '^if ^ ^ be such that the tori T{A{y)) have fixed 
energy. Then (evaluating the energy at the homoclinic point a = , <y9 = tt) one sees 
that 01 ■ A{y) is constant so that the line y ^ A{y) is parallel to = {u)2, — wi). 

Define y ^ A (y), y £ [0, 1], to be a elastic heteroclinic chain with flexibility parameters 
/3, 1? > and splitting /z if: 

(i) for a\l\y—y'\ < 'dfi there is a heteroclinic intersection between the stable and unstable 
manifolds of T{A{y)) and T{A{y')) with splitting angles > /x at <^ = tt. 

def 

(ii) the intersection matrix D = jiDo at = tt, a = verifies: 

C J- 7-1-1 ±\^''^ /3^n ^_def Ul^ (a -i \ 

{w -D^'w^) = [i^Q, = -. — I (6.1) 

\^\ 

where D,Do are y-independent (because of isochrony). 

(iii) a heteroclinic intersection at y = tt between W^*(i4(y)) and W^{A{y + S)) takes 
place at a^iS) = D-^ w-^\A'\^' + 0{^'^) for 5 = ^i?', \^'\ < i?, \A'\ = \A'{y)\ and: 

l\A'\PW\ < |(«,(<5') - a,(<5")) • < 2\A'\P\^\ (6.2) 

for all S' = iJi^', 5" = fi'&" and < 1? with 1? = 1?' - 1?". 

Remarks: 

(a) thus every sequence yo,y\, . . . ,yj\f with |t/j — j/j+i | < i^/x is a heteroclinic chain in the 
sense of §3, and the theorem proved in §5 applies to it. A elastic heteroclinic chain with 
parameter iD is also elastic with parameter 1?' < 1?. Hence it is not restrictive to suppose 
that 1? is as small as needed. 

(b) condition (6.1) is a transversality property while (6.2) is just saying that 1? is so small 
that the involved first order Taylor's expansions are "good" approximations (hence it is 
a weak condition and it follows from (ii) provided d is small enough). The geometrical 
meaning of (6.1), (6.2) is that when y varies by i5 (so that A{y) varies in orthogo- 
nally to L0_ by 0{5)), then the heteroclinic intersection ay{5) between W^{A{y)) and 
W^{ A{y + 5)) is away from in the direction orthogonal to w by 0(5ii~^) provided 
ifjL^^ = d' is small enough. 

The same remains true if one looks at the displacement of the heteroclinic intersection 
at any other section located away from the tori by a fixed distance k > 0, if e is small 
enough. In fact consider the intersection matrix D{t) evaluated along the heteroclinic 
trajectory at a time t after the passage through (y9 = tt. From the equations of motion its 
evolution is D{t) = D — j]^ da^^,f{(p{t), LOT)da A(r) dr where A(t) denotes the splitting 
in the (p- coordinates and ip{t) the heteroclinic evolution of i.e. D{t) — D + 0{£^) 
(while D = 0{e)), sec (5.5) in [GGM]. 

In particular if we look at the heteroclinic intersection point ip ^{S) at q = k, on the 
same heteroclinic trajectory, and compare it with the position of the homoclinic point 
V' ^(0) of T{A{y)) a,t q = K then we can say that, for some constants 2bi, 2bo (the factor 

2 is just convenient) it is \w-^ ■ {ii^{5') - ■ip_^{S"))\ e [26it?, 26oi9] , with 1? = {S' -S")iJ,-^. 
The constants &o, &i depend on k, which is however prefixed, and on /3. 

(c) examples exist; and generically fj, = 0{e). 

(d) the above definition is a special case of a natural more general definition rele- 
vant for higher dimensions and for anisochronous systems. For instance in the case 
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of anisochronoiis systems in which a term A"^ /'i.J ^ with J > constant, is added to (f .1) 
one has to require that y — *■ A{y) \& & simple rectifiable curve and that, uniformly in 
y e [0, 1], (6.1) holds with D replaced by the intersection matrix Dy, and w replaced by 
w (A (y)) = + A{y)J^^ . In higher dimensions one replaces (6.1) by requiring that the 
determinant of the matrix PyD~^Py, with the projection on the plane orthogonal 
to ijl{ A{y)), be non zero uniformly in y. But in the anisochronous cases the condition 
that for all y there is the torus T{A{y)), called "no gap condition", is strongly restrictive 
and quite artificial (although it is verified in the example in [A]), 
(e) One can free completely what follows from the parameter e and discuss everything 
in terms of ^ only. For simplicity we leave the general formulation to the reader and stay 
with £ small enough. Also for simplicity we shall take \A'\ — 1, as this is not restrictive. 

Theorem 2: Suppose that y ^ A{ii) is elastic in the above sense, then fixed a,b 
there exist heteroclinic chains Aq = A{yo), A^ = A{yi), . . . , Aj^ = A{yj^) with 
yo = a, yj\f = b along which the drift time is 0(/U~^). 

The estimates proceed by performing the construction of §5 without fixing a priori the 
heteroclinic chain: wc construct it inductively. 

Using the notations of §4 assume that yj have been constructed for j < i+1 together with 
tp .,rj,pj,Bj,B', for j < i. We must define yi+2, ^ ■ , , , ri+i,Pi+i and, as a consequence, 
-Bi+i, -B^+i- 

Let E be as in §5 and let E' be so large that if T° = log E'E~^ the points wt, 
t e [0,T°], fill the torus within ^bid (see (b) above for the definition of bi). This means 
that E' is very big: E' = cxpO{C'&~'^~^) if one uses the estimate that the time needed 
to a quasi periodic rotation of the torus with vector w , diophantine with constants C, r, 
to fill within 5 the whole torus is 0{C6~^~^) (for completeness see appendix A2). 

Let Xi+i{y) bo heteroclinic between T{A^_^_^) and T{A{y)) for y e [yi+i + ^M^i Vi+i + 
IJ,i}]. Let the local coordinates of Xj+i(i/) be (Aj_,_^, ip ._^^{y),0, k) (see §4 for the nota- 
tions). Let, see (4.1): 

Yi+l,y{t) = Ut+2,y{t)^ t,Pt+2,y{t),'^) (6-3) 

be the equation of {A{y)) in the local coordinates around the torus T{A^_f_i) near 
Xi+iiy). We may suppose that | A|+2,y(V^) - i:i+iMp|+2,j/( V^)! < hlilt - ti+iiy)l 
for some 63 of 0(1) and we may suppose 63 > 1, for simplicity. 

Suppose r small: a first approximation to V' will be a point '4' ^_^_iy^ at distance r 
from tp (y) such that: 

A \Ph2,y{U,,J = \ )|^, \pU.,y{±)\ (6.4) 

and A G [62/xr, 63^r], for some 62 = 0(1) > 0, by the assumption on the splitting. The 
constants 62, &3 are large or small with the "target" parameter k, fixed once and for all, 
see beginning of §4). Let d — 62/^3- Note that tp is defined non constructively. 

As r varies in the range djj^^ < r < the point ^ ^ ^ varies and A varies by a fac- 
tor not smaller than E' / E by our definition of d. Hence the time T{r) necessary in order 
that the backward evolution of the point li+i „( . , ) = ( ^,^4-9 „('/'■ , ), i/' , 
Pi+2,yi'^ y r)' interchanges the last two coordinates will vary by an amount >T^ = 
0{const i')^'^^^) , sec the first lines of this proof and (2.1). 

This implies, by continuity, that there will be a value r{y) such that the backward 
motion of duration T(r(y)) of Y^+i,y{ti+2,y,riy)^ V^-coordinate ti^^^^f^^^^y close 
within ^bi'd + to the coordinate V • of the point X[ = [A^j^^^, iI)'.,k,0) (around 
which the already known set B'^ is constructed (see (4.3)); i.e. closer than (as we 
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suppose that < assuming that pi < pi and pi is small enough, which will turn 
out to be not restrictive). We can even obtain that i) . , , is on a chosen side of the 
line through tjj '. parallel to u . 

Note that this is just a continuity statement: hence it is non constructive; furthermore 
other continuity statements will follow. In this sense the analysis is quite close in spirit 
to the variational approaches: nothing is really constructive. 

We now vary y G [yi+i + ^fi^,yi+i + the point ■_^_^{y) varies in the direction 
orthogonal to u by 611? at least (see remark (b) and condition (6.1)). 

This means that there is y*. in the considered interval, for which ih . , , having set 

r* = r{y*), is on the line parallel to u off ijj^ and within a distance bii}^ of it. 
Fixed y*,r*, let ip rotate on the circle of radius r* around By the definition 

of p|+2,y* ( V' ) its modulus A will vary at least by a factor 2 with respect to its value A* 
at V' ^. J see (6.4), (in fact it will vary between 2A* and 0, extremes included). 

Hence by suitably adjusting ip on the circle we can find a point tp such that the point 
Yi^i^y* = ( A|_|_2 j^. (■i/'^_(_-^)) i_^ijPi+2,y'i'^ evolving backwards in time exchanges 

the p,q coordinates in a time Tj+i ~ T{r*), and il^._^_^y^ ^, — wTi+i = tp'^ (below we 
shall worry about the difference li .,, , , ^ ■ . ■,)■ 

•' -!-i+l,y- ,r' ' 2-1+1' 

In fact in order to obtain this we only have to change the time T{r*) by an amount 
0(6i?9), and this is achieved by varying A off A* by a factor e'^^^^'') G [51 2], as it is not 
restrictive to take bi'd small. 

Since ^ . , differs from ip . , by a small amount 2r* < (at most, by construc- 
tion), this means that the point ^ — wTi+i differs from tp'. by at most recalling 

that 63 > 1. Since r* < the other coordinates verify bi+i.j,.( "0-^-^)1 < bs^^bts ^^'-^ 

< ^3/i2|fB (rccallthat At+^.y-it^+^iv*)) = A^+l)■ Hence the 
point Yi+\^y* evolves, backward, in time Tj+i to a point well inside B[: so do, in a time 
which differs suitably from Tj+i by a factor of 0(1), all the points close enough to it, say 

within Pi+i = r*/2. 

Hence if we set r,+i = r* > and pi+i = rj+ij^^y^, yi+2 = y* and A^_^_2 = 

A (yi+2) we see that all points of Bj+i are on the forward evolution of points in The 
time needed for the passage through -B^+i of the points of B'^, which visit it, is bounded 
proportionally to Tj+i . The radius pi has to be chosen so small so that the ratio between 
the variation of the time to exchange p and q is small enough {i.e. 0{^)). 

It follows that within a time T = const X]i=o the whole chain will be run by some 
trajectories. Hero TV > 2(6 - a){i}p)-^ and T, < 0{g-^ \og{E'y), so that drift between 
-A (a) and A{b) takes place in a time 0(5 "^2-'^): 

T < const 09-^2^"''-^'" (6.5) 
and, recalling that 1? is fixed, if = 0(e) this is const 2^°"**^ \ 
§7. Concluding remarks. Very fast diffusion? 

For a review on diffusion see [L] : in this paper the possibility of estimates of size of an 
inverse power of e is proposed and discussed. 

(1) The above nonvariational proof gives results not directly comparable to the best 
known, [Be], [Br], based on a variational method and giving (in [Br]) a polynomial drift 
time of 0{ii~^). 

The papers [Be], [Br], deal with Arnold's example, [A], i.e. with a different case. They use 
in an essential way the structure of the model, implying existence of a "gapless" system 
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of local coordinates in which the motion is "trivial" {i.e. given by (2.1)). Althongh such 
coordinate system does not appear explicitly in the proofs in [Br], it nevertheless exists 
under Arnold's assumptions as shown by [P]. 

Therefore this difference between the present paper and [Br] is not so important: the 
above proofs apply also to the model in [A] and [Br] (in fact in absence of gaps also 
anisochronous systems admit coordinates {A, ip ,p,q) with the properties of the ones in 
§2). Only the constants may be affected (in particular the ones in (6.5)), although I do 
not think that (6.5) changes. It is hard to see how to improve the bounds of §6, which 
are already quite non constructive. Hence the difference between the size of the bounds 
remains a puzzle that I do not understand. 

(2) It is worth stressing that the above methods apply every time there is a heteroclinic 
chain and "no gaps" around resonant tori: therefore they apply to the case in [A] with, 
in the notations of [A], /i = and c large enough. 

In the isochronous models they apply, immediately, to a variety of cases: a non trivial 
one is the hamiltonian (1.1) with lj = (??", r?"^/^), a > 0, e = iirj" with c large enough 
and, possibly, even a further " m,onochrorn,atic, strong and rapid^ perturbation /3/o((/3, A) 
like /3cos(A + ip) with /3 = 0(1). Suppose that we consider only values of ry such that 
\uj_- v\> Cr]'^\v\-'' for &\\ v <^ Z"^, see §2 in [GGM]. Then hy using the results of 
[GGM] (§8) we see that if rj is fixed small enough the homoclinic splitting is analytic in 
(3 for \j3\ < 0{r]^^^'^), while it does not vanish for /? small {i.e. (3 = 0{r]'^)), generically 
in / (but it is very small, see [GGM], §6). Hence it is not for all /3 < 2 (say) except 
possibly finitely many values of (3. This means that in such strongly perturbed systems 
{(3 = 0(1)) one still has elastic heteroclinic chains of arbitrary length, see §8 of [GGM], 
and therefore there is diffusion (provable by the methods of §5, §6). Furthermore the 
A -independent (because of isochrony, see [GGM]) homoclinic angles can become large 
when P, jj, approach their convergence radii and this gives us the possibility of "very fast" 
drift on time scales of ~ 0(1). In fact I think that the homoclinic splitting might be a 
monotonic function of e. 3 for interesting classes of perturbations. 

A similar analysis can be made for the model in [A] (which is also without gaps). 

(3) An advantage of the technique of §5 is its flexibility which makes it immediately 
applicable, essentially without change, to anisochronous systems, see [CG]. 

(4) Constructivity, even partial (see comments in §5), seems the key to understanding 
the huge difference between the results of §5 and the variational results, or those of 
§6 above: diffusion time bounds in an inverse power of e (in [Br] and §6) versus an 
exponential in the more constructive proposal in §5. A hint in this direction is provided 
by the bound in §6: by adding a new idea to the method of §5, i.e. of [CG], one can get a 
drift time estimate of 0{e^'^) instead of the exponential of [CG], and §5. But the theory 
becomes now less constructive: not even the sequence of close encounters with invariant 
tori is determined. 

(5) It seems possible that the construction of §6 might be rendered constructive without 
losing the bounds: this is certainly an interesting problem. 

(6) Finally we discussed only drift in phase space: but it is clear that heteroclinic 

chains do not need to "advance" at each step {e.g. a A-coordinatc needs not to increase 
systematically): we can use heteroclinic chains that advance and back up at our prefixed 
wish {e.g. randomly). Hence, in this sense, there is no difference between drift and 
diffusion. 
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ticular, to G. Gentile and V. Mastropietro for many discussions and help in revising the 
manuscript. This work is part of the research program of the European Network on: 
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Appendix Al. Jacobi's map. 

This appendix is standard: here it is taken from A9 of [CG] with small changes, to use 
it for future references. 

The theory of jacobian elliptic functions shows how to perform a complete calculation of 
the functions, below denoted R, S, in terms of which the canonical Jacobi's coordinates 
are defined, see [GR] (9. 198), (9. 153), (9.146), (9.128), (9.197). The result, reported for 
completeness, is discussed in terms of the pendulum energy: 



{Al.l) 



where the origin in (p is set at the stable equilibrium, to adhere to the notations in the 
theory of elliptic functions. Setting u = t(iiJ/2)^/^ = e^^'^gt, k'^ = Ig^ jE = where e 
is the dimensionless energy so that £ = 1 is the separatrix, let: 



K{k) 



7r/2 



da 



(l-fc2sin2a)i/2 



(A1.2) 



We shall use the "standard" notations {i.e. those in [GR]) for the jacobian elliptic inte- 
grals except for x{.), which is usually denoted but which we would confuse with the 
variable q that we want to construct: 



k' =(1 - A;2)V2, 



9j 



e'/'g, 



ll-fcl/2 



'2fcK(fc') ^ '■~21 + A;V2 

x{k') =e-'^K(fc)/K(fc') ^ ;^ ^ + 15X^ + 150A^^ + ITOTA^^ + . . . 



(A1.3) 



In terms of the above notations we have, directly from the definitions (i.e. from the 
equations of motion): 



I{t) = (p= -2.ge^/2 dn {u, k), (p{t) = 2 am (tge^^^) 



(A1.4) 



which yield, changing the origin for ip to the unstable point to conform with our notations 
{i.e. obtaining (p(t) = 2( am (tge^/^) + 7r/2)), for I{t) = R{p{t),q{t)),p{t) = S{p{t),q{t)): 



. S_ _ 
cn(m, fc')' 2 



S . sn {lu, k ) , ^, 

/. cos-=z — ;. ' (A1.5) 

cn{iu,k') 2 cn{iu,k') 



Setting p = e^f '*, q = x{k')e^''^, see [GR], and using R{p, q) = 9j{—pdp + qdq)S{p, q) to 
evaluate S from R, the quoted basic relations between elliptic integrals imply immediately 
that the I{t) = R{p{t),q{t)),(p{t) = S{p{t),q{t)), solve the pendulum equations if: 



R{p,q) 



S{p, q) =2 arctgp - arctgr^ - ^(-1)"1+| 



(A1.6) 



sm 



cos 



Sjp.q) _gj 



1+r 



Sn=l( 1)" i+i^2n-i (P^ 



S{p,q) — _ gj_ 

2 2g 



with X = pq. Note that gj depends on x, and so do k', k: hence the coefficients of 

the first and of the last two of (A1.6) are also functions of a; = pq. Furthermore the 
(dimensionless) energy e becomes a function of = pq defined by inverting the map: 



^ = x{k') = x{{i - s-^y/^) 
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{A1.7) 



and the point corresponding to <^ = tt and to a dimensionless energy e, has coordinates: 



p = 1, q = x{k') 



(A1.8) 



(a rearrangement of (A1.6) showing convergence for p = 1 and |a;| < 1 is exhibited below). 
The variables {p. q) defined above are nice and natural: however they arc not canonically 
conjugated to (/, ip): the jacobian determinant of the map (p, q) < — > (/, (p) is not 1. But 
the jacobian determinant must be a function D{x) = g'lj''^^ of x alone {i.e. of the product 

-1 _ „-l2g^deix) _ /|„<fel/2 



pq); then (Al.8) and the equations of motion imply that D{x) = Qj 



dx ^ dx ' 

Therefore one can modify the variables p, q into new variables {pj^ qj) = {pF{x), qF{x)) 

^^^^ - ^^^D{x), which is D{x) ■ d,{xF\x)), is 

X / 



with F such that the jacobian g^-j ^ 

identically 1. One finds: F{x) = (4g)^/^( 

To invert the map {pj, qj) 
then: p = pjG{xj) and q = qjG{xj), x 



{pF{x),qF{x)) define xj'^=^ pjQj and G(xj) =^>(x)- 
xjGHxj) 



The final result is a local canonical 
map between Jacobi's coordinates {pj,qj) and global {I, if) coordinates: 



I = R{pjG{xj), qjG{xj)), <p = S{pjG{xj), qjG{xj)) 



(A1.9) 



where R, S are defined above, see (A1.6) which are written in a form easily recognized in 
the elliptic functions tables. The functions R, S can be rewritten in the following form: 



X q 



R{p,q)=-4g [E'^=o{tt9^- 
S{p,q) =4 [Em=o(arctgx"'p- arctgx^g)] 



sm 



cos 



S(p,g) ^ 2gj(x) 

2 g 

Sip,q) _ gj{x) 

2 2g 



i-2E™=o(-ir( 



p^ l+x 
X p' 



X q 

— jsr: 



l_|_j;2mp2 



+ 



{ALIO) 



exhibiting some of the properties of the Jacobi map in a better way. 

One checks that in the {pj, qj) variables the pendulum hamiltonian, (Al.l) has become 
a function J{pjqj) = 2g^ + gxj + 0{x^). The domain of definition of the map is given by 
the properties of the elliptic functions or, more restrictively, by the domain of convergence 

of the above scries. It inludes a disk of some radius pj > around the origin. 
The important symmetry R{p,q) = R{q,p) and S{p,q) = ~S{q,p) is manifest. 

Appendix A2. Filling times of quasi periodic motions. 

Let (wi, . . . ,Ud) = uj_ & R'^ he such that ■ < C\v\'' . Let x(a;),X±(a;) be C°°- 

functions even and strictly positive for |a;| < ^tt, vanishing elsewhere and with integral 



V^q)/! w I) ■x(e" 



1. Let ±^ e T'^ and 21(3^) =^£-(''-i)x(w • {± 

= orthogonal projection on the plane orthogonal to w . 
The function x can be naturally regarded as defined and periodic on T'^: if x{'^) 
is the Fourier transform of x as a function on R then the Fourier transform of x is 
xC'^'Oxls^l id """!)) H integer components vector, j/H = w • w |, v-^ = P-^v. The aver- 
age T Jq x{wt)dtis: 



X = 1+Y: x{E)e-'^o-^^'—Z_l > 1 - ^ 5: |x(^^ll)x(^k^|) 



{A2.1) 



Since the last sum is bounded above by be ^"^^"^ the average X is positive for all 
if T > 26Cff-(^+'^-i). This means that for T > 2bCs-^^+'^-^^ +7r/|a;|, hence for 
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T > BCe~^'^'^'^~'^^ with B a suitable constant depending only on d, the torus will have 
been filled by the trajectory of any point within a distance e. This proof is taken from 
(5), p. Ill, of [Gl], see [BGL] for an alternative proof. 

References 

[A] Arnold, V.: Instability of dynamical systems with several degrees of freedom, Sov. Math- 
ematical Dokl., 5, 581-585, 1966. 
[Be] Bessi, U.: it An approach to Arnold's diffusion through the Calculus of Variations, 

Nonlinear Analysis, 1995. 
[Br] Bernard, P.: Perturbation d'un hamiltonien partiellement hyperbolique, C.R. Academic 
des Sciences de Paris, 323, I, 189-194, 1996. 
[BGL] Bourgain, J., Golze, ., Lochak, P.: , preprint, in print in J. Statistical Physics. 
[CG] Chierchia, L., Gallavotti, G.: Drift and diffusion in phase space, Annales de 1' Institut 
Poincare, B, 60, 1-144, 1994. 
[E] Eliasson, L.H.: Absolutely convergent series expansions for quasi-periodic motions. Ma- 
thematical Physics Elctronic Journal, MPJE, 2, 1996. 
[Gl] Gallavotti, G.: The elements of mechanics. Springer, 1983. 

[G2] Gallavotti, G.: Twistless KAM tori, Communications in Mathematical Physics 164, 
145-156, (1994). 

[Ge] Gentile, G. : A proof of existence of whiskered tori with quasi flat homoclinic intersections 

in a class of almost integrable systems. Forum Mathematicum, 7, 709-753, 1995. See 

also: Whiskered tori with prefixed frequencies and Lyapunov spectrum, Dynamics and 

Stability of Systems, 10, 269-308, 1995. 
[GG] G. Gallavotti, G. Gentile: Majorant series convergence for twistless KAM tori, Ergodic 

theory and dynamical systems 15, 857-869, (1995). 
[GGM] G. Gallavotti, G. Gentile, V. Mastropietro: Pendulum: separatrix splitting, preprint, 

chao-dyn@xyz. lanl. gov 9709004. And G. Gallavotti, G. Gentile, V. Mastropietro: 

Arnold's diffusion is anisochronous systems, in preparation. 
[GR] Gradshteyn, I.S., Ryzhik, I.M.: Table of integrals series and products. Academic Press, 

1965. 

[L] Lochak, P.: Arnold's diffusion: a compendium of remarks and questions. Proceedings 
of 3DHAM, s'Agaro, 1995, in print. 

[P] Perfetti, P.: Fixed point theorems in the Arnol'd model about instability of the action- 
variables in phase space, mp_arc@math.utexas.edu, #97-478, 1997, in print in Discrete 
and continuous dynamical systems. 

[T] Thirring, W.: Course in Mathematical Physics, vol. 1, p. 133, Springer, Wien, 1983. 



Internet: Author's preprints downloadable (latest version) at: 





tittp : / / chimera . romal . inf n . it 




http: 


/ / www . math . rutger s . edu/~giovanni 



Mathematical Physics Preprints (mirror) pages, 
e-mail: giovanni@ipparco. romal .infn.it 

Archived also in: 

mp_arc@math.utexas.edu # 97-??? and 
chao-dyn@xyz.lanl.gov # 9709011 



13 



